Using Machine Learning Language Models to Generate Innovation Knowledge Graphs for Patent Mining

نویسندگان

چکیده

To explore and understand the state-of-the-art innovations in any given domain, researchers often need to study many domain patents synthesize their knowledge content. This provides a smart patent graph generation system, adopting machine learning (ML) natural language modeling approach, help grasp by generating deep graphs. research focuses on converting chemical utility patents, consisting of chemistries processes, into summarized The methods are two parts, i.e., visualization processes patents’ most relevant paragraphs domain-specific collection texts. ML algorithms, including ALBERT for text vectorization, Sentence-BERT sentence classification, KeyBERT keyword extraction, adopted. These models trained tested case using 879 carbon capture domain. results demonstrate that average retention rate summary graphs five clustered texts exceeds 80%. proposed approach is novel proven be reliable graphical representation.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Machine Learning Models for Housing Prices Forecasting using Registration Data

This article has been compiled to identify the best model of housing price forecasting using machine learning methods with maximum accuracy and minimum error. Five important machine learning algorithms are used to predict housing prices, including Nearest Neighbor Regression Algorithm (KNNR), Support Vector Regression Algorithm (SVR), Random Forest Regression Algorithm (RFR), Extreme Gradient B...

متن کامل

Dust source mapping using satellite imagery and machine learning models

Predicting dust sources area and determining the affecting factors is necessary in order to prioritize management and practice deal with desertification due to wind erosion in arid areas. Therefore, this study aimed to evaluate the application of three machine learning models (including generalized linear model, artificial neural network, random forest) to predict the vulnerability of dust cent...

متن کامل

Patent mining: combining dictionary-based and machine-learning approaches

Exploration of the chemical patent space is essential for early-stage medicinal chemistry activities. The BioCreative CHEMDNER-patents task focuses on the recognition of chemical compounds in patents. This includes recognition of chemical named entities in patents (CEMP), classification of chemical-related patent titles and abstracts (CPD), and recognition of genes and proteins in patent abstra...

متن کامل

Using English Dictionaries to generate Commonsense Knowledge in Natural Language

This paper presents an approach to generating common sense knowledge written in raw English sentences. Instead of using public contributors to feed this source, this system chose to employ expert linguistics decisions by using definitions from English dictionaries. Because the definitions in English dictionaries are not prepared to be transformed into inference rules, some preprocessing steps w...

متن کامل

Using Learning Techniques to Generate System Models for Online Testing

Today’s software systems are mostly modular and have to be changeable. However, the testing of such systems becomes difficult, especially when changes are applied after deployment. One way to passively test such a system is to check whether the observed traces are accepted by a system model. In this paper, we present a method to generate a model of the System Under Test from its test cases. We ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Applied sciences

سال: 2022

ISSN: ['2076-3417']

DOI: https://doi.org/10.3390/app12199818